Analysis of approaches for supporting the Open Provenance Model: A case study of the Trident workflow workbench
نویسندگان
چکیده
Microsoft Research, Redmond WA 98052 Abstract The Trident workbench is a platform for composing, executing and managing scientific workflows. While Trident collects provenance in its native provenance model, the third provenance challenge was an opportunity to build support for the Open Provenance Model into Trident. There are several possible approaches to harmonize our native model with OPM, and such choices are also available to other existing provenance and workflow systems working towards OPM compatibility. We identify and analyze the relative merits of these approaches in an effort to inform practitioners planning to support OPM in their existing provenance/workflow systems. Further, we describe our experience with using the integration approach we choose to interoperate with other teams as part of the challenge.
منابع مشابه
Development of complex scientific workflows: towards end-to-end workflows
The analysis of water planning options on environmental assets relies on combining mathematical models from several disciplines. The growing complexity of these modelling tasks increases the potential for mistakes and misinforming stakeholders and the public. Through better capture of provenance information (audit trails), scientific workflow tools improve the transparency of model interactions...
متن کاملProvenance for Scientific Workflows Towards Reproducible Research
eScience has established itself as a key pillar in scientific discovery, continuing the evolution of the scientific discovery process from theoretical to empirical to computational science [13]. Extensive deployment of instruments and sensors that observe the physical and biological world are bringing in large and diverse data to the reach of scientists. Often, that data is more frequently shar...
متن کاملEditorial : Scientific Workflows , Provenance and Their Applications
Scientific workflows play a crucial role in modern eScience [5] where many significant scientific discoveries are achieved through complex and distributed computations. For many scientists in the Life Sciences, in bioinformatics, geosciences, chemistry, physics, and numerous other domains, scientific workflows have become an enabling technology to formalize and automate complex and data intensi...
متن کاملUnderstanding Collaborative Studies through Interoperable Workflow Provenance
The provenance of a data product contains information about how the product was derived, and is crucial for enabling scientists to easily understand, reproduce, and verify scientific results. Currently, most provenance models are designed to capture the provenance related to a single run, and mostly executed by a single user. However, a scientific discovery is often the result of methodical exe...
متن کاملAccuracy evaluation of different statistical and geostatistical censored data imputation approaches (Case study: Sari Gunay gold deposit)
Most of the geochemical datasets include missing data with different portions and this may cause a significant problem in geostatistical modeling or multivariate analysis of the data. Therefore, it is common to impute the missing data in most of geochemical studies. In this study, three approaches called half detection (HD), multiple imputation (MI), and the cosimulation based on Markov model 2...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Future Generation Comp. Syst.
دوره 27 شماره
صفحات -
تاریخ انتشار 2011